Variance Breakdown of Huber ( M ) - estimators : n / p → m ∈ ( 1 , ∞ )
نویسندگان
چکیده
Huber’s gross-errors contamination model considers the class Fε of all noise distributions F = (1 − ε)Φ + εH, with Φ standard normal, ε ∈ (0, 1) the contamination fraction, and H the contaminating distribution. A half century ago, Huber evaluated the minimax asymptotic variance in scalar location estimation, min ψ max F∈Fε V (ψ, F ) = 1 I(F ∗ ε ) (1) where V (ψ,F ) denotes the asymptotic variance of the (M)-estimator for location with score function ψ, and I(F ∗ ε ) is the minimal Fisher information minFε I(F ). We consider the linear regression model Y = Xθ0 +W , Wi ∼i.i.d. F , and iid Normal predictors Xi,j , working in the high-dimensional-limit asymptotic where the number n of observations and p of variables both grow large, while n/p→ m ∈ (1,∞); hence m plays the role of ‘asymptotic number of observations per parameter estimated’. Let Vm(ψ,F ) denote the per-coordinate asymptotic variance of the (M)-estimator of regression in the n/p → m regime [EKBBL13, DM13, Kar13]. Then Vm 6= V ; however Vm → V as m→∞. In this paper we evaluate the minimax asymptotic variance of the Huber (M)-estimate. The statistician minimizes over the family (ψλ)λ>0 of all tunings of Huber (M)-estimates of regression, and Nature maximizes over gross-error contaminations F ∈ Fε. Suppose that I(F ∗ ε )·m > 1. Then min λ max F∈Fε Vm(ψλ, F ) = 1 I(F ∗ ε )− 1/m . (2) Of course, the RHS of (2) is strictly bigger than the RHS of (1). Strikingly, if I(F ∗ ε ) ·m ≤ 1, then min λ max F∈Fε Vm(ψλ, F ) =∞. In short, the asymptotic variance of the Huber estimator breaks down at a critical ratio of observations per parameter. Classically, for the minimax (M)-estimator of location, no such breakdown occurs [DH83]. However, under this paper’s n/p→ m asymptotic, the breakdown point is where the Fisher information per parameter equals unity: ε∗ ≡ εm(Minimax Huber-(M) Estimate) = inf{ε : m · I(F ∗ ε ) ≥ 1}. ∗Department of Statistics, Stanford University †Department of Electrical Engineering and Department of Statistics, Stanford University
منابع مشابه
Breakdown Points and Variation Exponents of Robust M-estimators in Linear Models
The breakdown point behavior of M-estimators in linear models with xed designs, arising from planned experiments or qualitative factors, is characterized. Particularly, this behavior at xed designs is quite diierent from that at designs which can be corrupted by outliers|the situation prevailing in the literature. For xed designs, the breakdown points of robust M-estimators (those with bounded ...
متن کاملHigher order asymptotics for the MSE of M-estimators on shrinking neighborhoods
In the setup of shrinking neighborhoods about an ideal central model, Rieder (1994) determines the as. linear estimator minimaxing MSE on these neighborhoods. We address the question to which degree this as. optimality carries over to finite sample size. We consider estimation of a one-dim. location parameter by means of M-estimators Sn with monotone influence curve ψ . Using Donoho and Huber (...
متن کاملRobust Methods for Mean and Covariance Structure Analysis
Covariance structure analysis plays an important role in social and behavioral sciences to evaluate hypothesized in uences among unmeasured latent and observed variables. Existing methods for analyzing these data rely on unstructured sample means and covariances estimated under normality, and evaluate a proposed structural model using statistical theory based on normal theory MLE and generalize...
متن کاملOn Mathematical Characteristics of some Improved Estimators of the Mean and Variance Components in Elliptically Contoured Models
In this paper we treat a general form of location model. It is typically assumed that the error term is distributed according to the law belonging to the class of elliptically contoured distribution. Some sorts of shrinkage estimators of location and scale parameters are proposed and their exact bias and MSE expressions are derived. The performance of the estimators under study are compl...
متن کاملOn M-estimators of Approximate Quantiles and Approximate Conditional Quantiles
M-estimators introduced in Huber (1964) provide a class of robust estimators of a center of symmetry of a symmetric probability distribution which also have very high eeciency at the model. However it is not clear what they do estimate when the probability distributions are nonsymmetric. In this paper we rst show that in the case of arbitrary, not necessarily symmetric probabilty distributions,...
متن کامل